Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 8399 |
| Missing cells | 966 |
| Missing cells (%) | 0.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.6 MiB |
| Average record size in memory | 200.0 B |
Variable types
| Text | 3 |
|---|---|
| Numeric | 11 |
| Categorical | 9 |
| DateTime | 2 |
number_of_records has constant value "" | Constant |
order_id is highly overall correlated with row_id | High correlation |
row_id is highly overall correlated with order_id | High correlation |
sales is highly overall correlated with shipping_cost and 1 other fields | High correlation |
shipping_cost is highly overall correlated with sales and 2 other fields | High correlation |
unit_price is highly overall correlated with sales and 1 other fields | High correlation |
zip_code is highly overall correlated with region and 1 other fields | High correlation |
product_category is highly overall correlated with product_sub_category | High correlation |
product_container is highly overall correlated with product_sub_category and 1 other fields | High correlation |
product_sub_category is highly overall correlated with product_category and 2 other fields | High correlation |
region is highly overall correlated with zip_code and 1 other fields | High correlation |
ship_mode is highly overall correlated with shipping_cost and 2 other fields | High correlation |
state is highly overall correlated with zip_code and 1 other fields | High correlation |
customer_age has 903 (10.8%) missing values | Missing |
row_id is uniformly distributed | Uniform |
row_id has unique values | Unique |
discount has 756 (9.0%) zeros | Zeros |
Reproduction
| Analysis started | 2023-10-22 10:05:24.206409 |
|---|---|
| Analysis finished | 2023-10-22 10:05:44.159187 |
| Duration | 19.95 seconds |
| Software version | ydata-profiling vv4.6.0 |
| Download configuration | config.json |
city
Text
| Distinct | 1421 |
|---|---|
| Distinct (%) | 16.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.7 KiB |
Length
| Max length | 19 |
|---|---|
| Median length | 16 |
| Mean length | 9.1384689 |
| Min length | 3 |
Characters and Unicode
| Total characters | 76754 |
|---|---|
| Distinct characters | 52 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 151 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | McKeesport |
|---|---|
| 2nd row | Bowie |
| 3rd row | Napa |
| 4th row | Montebello |
| 5th row | Napa |
| Value | Count | Frequency (%) |
| city | 266 | 2.4% |
| park | 165 | 1.5% |
| beach | 116 | 1.0% |
| west | 112 | 1.0% |
| heights | 107 | 1.0% |
| san | 97 | 0.9% |
| north | 89 | 0.8% |
| lake | 88 | 0.8% |
| saint | 85 | 0.8% |
| hills | 73 | 0.7% |
| Other values (1377) | 9992 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 7209 | 9.4% |
| a | 6797 | 8.9% |
| n | 5679 | 7.4% |
| o | 5628 | 7.3% |
| l | 5073 | 6.6% |
| r | 4939 | 6.4% |
| i | 4617 | 6.0% |
| t | 3962 | 5.2% |
| s | 3383 | 4.4% |
| 2791 | 3.6% | |
| Other values (42) | 26676 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 62773 | |
| Uppercase Letter | 11190 | 14.6% |
| Space Separator | 2791 | 3.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 7209 | |
| a | 6797 | |
| n | 5679 | |
| o | 5628 | |
| l | 5073 | 8.1% |
| r | 4939 | 7.9% |
| i | 4617 | 7.4% |
| t | 3962 | 6.3% |
| s | 3383 | 5.4% |
| d | 1997 | 3.2% |
| Other values (16) | 13489 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1214 | 10.8% |
| S | 1001 | 8.9% |
| P | 952 | 8.5% |
| M | 851 | 7.6% |
| B | 851 | 7.6% |
| H | 707 | 6.3% |
| L | 688 | 6.1% |
| W | 555 | 5.0% |
| R | 548 | 4.9% |
| A | 472 | 4.2% |
| Other values (15) | 3351 |
Space Separator
| Value | Count | Frequency (%) |
| 2791 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 73963 | |
| Common | 2791 | 3.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 7209 | 9.7% |
| a | 6797 | 9.2% |
| n | 5679 | 7.7% |
| o | 5628 | 7.6% |
| l | 5073 | 6.9% |
| r | 4939 | 6.7% |
| i | 4617 | 6.2% |
| t | 3962 | 5.4% |
| s | 3383 | 4.6% |
| d | 1997 | 2.7% |
| Other values (41) | 24679 |
Common
| Value | Count | Frequency (%) |
| 2791 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 76754 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 7209 | 9.4% |
| a | 6797 | 8.9% |
| n | 5679 | 7.4% |
| o | 5628 | 7.3% |
| l | 5073 | 6.6% |
| r | 4939 | 6.4% |
| i | 4617 | 6.0% |
| t | 3962 | 5.2% |
| s | 3383 | 4.4% |
| 2791 | 3.6% | |
| Other values (42) | 26676 |
customer_age
Real number (ℝ)
MISSING 
| Distinct | 48 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 903 |
| Missing (%) | 10.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.542823 |
| Minimum | 41 |
|---|---|
| Maximum | 95 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 65.7 KiB |
Quantile statistics
| Minimum | 41 |
|---|---|
| 5-th percentile | 42 |
| Q1 | 47 |
| median | 53 |
| Q3 | 61 |
| 95-th percentile | 71 |
| Maximum | 95 |
| Range | 54 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 9.5194352 |
|---|---|
| Coefficient of variation (CV) | 0.1745314 |
| Kurtosis | -0.072222184 |
| Mean | 54.542823 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.6106657 |
| Sum | 408853 |
| Variance | 90.619647 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 46 | 413 | 4.9% |
| 41 | 356 | 4.2% |
| 47 | 330 | 3.9% |
| 51 | 329 | 3.9% |
| 56 | 310 | 3.7% |
| 44 | 304 | 3.6% |
| 48 | 299 | 3.6% |
| 55 | 293 | 3.5% |
| 50 | 281 | 3.3% |
| 42 | 276 | 3.3% |
| Other values (38) | 4305 | |
| (Missing) | 903 | 10.8% |
| Value | Count | Frequency (%) |
| 41 | 356 | |
| 42 | 276 | |
| 43 | 263 | |
| 44 | 304 | |
| 45 | 252 | |
| 46 | 413 | |
| 47 | 330 | |
| 48 | 299 | |
| 49 | 203 | |
| 50 | 281 |
| Value | Count | Frequency (%) |
| 95 | 8 | 0.1% |
| 93 | 1 | < 0.1% |
| 88 | 7 | 0.1% |
| 86 | 19 | |
| 85 | 2 | < 0.1% |
| 84 | 1 | < 0.1% |
| 82 | 8 | 0.1% |
| 81 | 11 | |
| 80 | 3 | < 0.1% |
| 79 | 21 |
customer_name
Text
| Distinct | 795 |
|---|---|
| Distinct (%) | 9.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.7 KiB |
Length
| Max length | 22 |
|---|---|
| Median length | 19 |
| Mean length | 12.867127 |
| Min length | 7 |
Characters and Unicode
| Total characters | 108071 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Jessica Myrick |
|---|---|
| 2nd row | Matt Collister |
| 3rd row | Alan Schoenberger |
| 4th row | Elizabeth Moffitt |
| 5th row | Alan Schoenberger |
| Value | Count | Frequency (%) |
| michael | 105 | 0.6% |
| john | 93 | 0.6% |
| brown | 93 | 0.6% |
| liz | 87 | 0.5% |
| michelle | 86 | 0.5% |
| jones | 86 | 0.5% |
| patrick | 83 | 0.5% |
| bill | 80 | 0.5% |
| alan | 77 | 0.5% |
| price | 75 | 0.4% |
| Other values (895) | 15961 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 10075 | 9.3% |
| e | 9451 | 8.7% |
| n | 8461 | 7.8% |
| 8427 | 7.8% | |
| r | 7919 | 7.3% |
| i | 6569 | 6.1% |
| l | 5720 | 5.3% |
| o | 5302 | 4.9% |
| t | 4465 | 4.1% |
| s | 3528 | 3.3% |
| Other values (43) | 38154 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 82315 | |
| Uppercase Letter | 17202 | 15.9% |
| Space Separator | 8427 | 7.8% |
| Other Punctuation | 127 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1563 | 9.1% |
| B | 1510 | 8.8% |
| S | 1487 | 8.6% |
| M | 1487 | 8.6% |
| D | 1121 | 6.5% |
| J | 1009 | 5.9% |
| A | 988 | 5.7% |
| P | 880 | 5.1% |
| H | 796 | 4.6% |
| T | 784 | 4.6% |
| Other values (16) | 5577 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 10075 | |
| e | 9451 | |
| n | 8461 | |
| r | 7919 | |
| i | 6569 | 8.0% |
| l | 5720 | 6.9% |
| o | 5302 | 6.4% |
| t | 4465 | 5.4% |
| s | 3528 | 4.3% |
| h | 3269 | 4.0% |
| Other values (15) | 17556 |
Space Separator
| Value | Count | Frequency (%) |
| 8427 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 127 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 99517 | |
| Common | 8554 | 7.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 10075 | 10.1% |
| e | 9451 | 9.5% |
| n | 8461 | 8.5% |
| r | 7919 | 8.0% |
| i | 6569 | 6.6% |
| l | 5720 | 5.7% |
| o | 5302 | 5.3% |
| t | 4465 | 4.5% |
| s | 3528 | 3.5% |
| h | 3269 | 3.3% |
| Other values (41) | 34758 |
Common
| Value | Count | Frequency (%) |
| 8427 | ||
| ' | 127 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 108071 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 10075 | 9.3% |
| e | 9451 | 8.7% |
| n | 8461 | 7.8% |
| 8427 | 7.8% | |
| r | 7919 | 7.3% |
| i | 6569 | 6.1% |
| l | 5720 | 5.3% |
| o | 5302 | 4.9% |
| t | 4465 | 4.1% |
| s | 3528 | 3.3% |
| Other values (43) | 38154 |
customer_segment
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.7 KiB |
| Corporate | |
|---|---|
| Home Office | |
| Consumer | |
| Small Business |
Length
| Max length | 14 |
|---|---|
| Median length | 11 |
| Mean length | 10.265032 |
| Min length | 8 |
Characters and Unicode
| Total characters | 86216 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Small Business |
|---|---|
| 2nd row | Home Office |
| 3rd row | Corporate |
| 4th row | Consumer |
| 5th row | Corporate |
Common Values
| Value | Count | Frequency (%) |
| Corporate | 3076 | |
| Home Office | 2032 | |
| Consumer | 1649 | |
| Small Business | 1642 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| corporate | 3076 | |
| home | 2032 | |
| office | 2032 | |
| consumer | 1649 | |
| small | 1642 | |
| business | 1642 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 10431 | |
| o | 9833 | 11.4% |
| r | 7801 | 9.0% |
| s | 6575 | 7.6% |
| m | 5323 | 6.2% |
| C | 4725 | 5.5% |
| a | 4718 | 5.5% |
| f | 4064 | 4.7% |
| 3674 | 4.3% | |
| i | 3674 | 4.3% |
| Other values (10) | 25398 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 70469 | |
| Uppercase Letter | 12073 | 14.0% |
| Space Separator | 3674 | 4.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 10431 | |
| o | 9833 | |
| r | 7801 | |
| s | 6575 | |
| m | 5323 | |
| a | 4718 | |
| f | 4064 | 5.8% |
| i | 3674 | 5.2% |
| u | 3291 | 4.7% |
| n | 3291 | 4.7% |
| Other values (4) | 11468 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 4725 | |
| O | 2032 | |
| H | 2032 | |
| S | 1642 | 13.6% |
| B | 1642 | 13.6% |
Space Separator
| Value | Count | Frequency (%) |
| 3674 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 82542 | |
| Common | 3674 | 4.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 10431 | |
| o | 9833 | |
| r | 7801 | 9.5% |
| s | 6575 | 8.0% |
| m | 5323 | 6.4% |
| C | 4725 | 5.7% |
| a | 4718 | 5.7% |
| f | 4064 | 4.9% |
| i | 3674 | 4.5% |
| u | 3291 | 4.0% |
| Other values (9) | 22107 |
Common
| Value | Count | Frequency (%) |
| 3674 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 86216 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 10431 | |
| o | 9833 | 11.4% |
| r | 7801 | 9.0% |
| s | 6575 | 7.6% |
| m | 5323 | 6.2% |
| C | 4725 | 5.5% |
| a | 4718 | 5.5% |
| f | 4064 | 4.7% |
| 3674 | 4.3% | |
| i | 3674 | 4.3% |
| Other values (10) | 25398 |
discount
Real number (ℝ)
ZEROS 
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.049671389 |
| Minimum | 0 |
|---|---|
| Maximum | 0.25 |
| Zeros | 756 |
| Zeros (%) | 9.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 65.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.02 |
| median | 0.05 |
| Q3 | 0.08 |
| 95-th percentile | 0.1 |
| Maximum | 0.25 |
| Range | 0.25 |
| Interquartile range (IQR) | 0.06 |
Descriptive statistics
| Standard deviation | 0.03182302 |
|---|---|
| Coefficient of variation (CV) | 0.64067102 |
| Kurtosis | -0.95941106 |
| Mean | 0.049671389 |
| Median Absolute Deviation (MAD) | 0.03 |
| Skewness | 0.073916963 |
| Sum | 417.19 |
| Variance | 0.0010127046 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.01 | 806 | |
| 0.05 | 786 | |
| 0.03 | 779 | |
| 0.09 | 778 | |
| 0.04 | 770 | |
| 0.08 | 765 | |
| 0.02 | 765 | |
| 0 | 756 | |
| 0.1 | 745 | |
| 0.06 | 734 | |
| Other values (6) | 715 |
| Value | Count | Frequency (%) |
| 0 | 756 | |
| 0.01 | 806 | |
| 0.02 | 765 | |
| 0.03 | 779 | |
| 0.04 | 770 | |
| 0.05 | 786 | |
| 0.06 | 734 | |
| 0.07 | 710 | |
| 0.08 | 765 | |
| 0.09 | 778 |
| Value | Count | Frequency (%) |
| 0.25 | 1 | < 0.1% |
| 0.21 | 1 | < 0.1% |
| 0.17 | 1 | < 0.1% |
| 0.16 | 1 | < 0.1% |
| 0.11 | 1 | < 0.1% |
| 0.1 | 745 | |
| 0.09 | 778 | |
| 0.08 | 765 | |
| 0.07 | 710 | |
| 0.06 | 734 |
number_of_records
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.7 KiB |
| 1 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8399 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 8399 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 8399 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 8399 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8399 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8399 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8399 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 8399 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8399 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 8399 |
order_date
Date
| Distinct | 1418 |
|---|---|
| Distinct (%) | 16.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.7 KiB |
| Minimum | 2012-01-01 00:00:00 |
|---|---|
| Maximum | 2015-12-30 00:00:00 |
order_id
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 5496 |
|---|---|
| Distinct (%) | 65.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29965.18 |
| Minimum | 3 |
|---|---|
| Maximum | 59973 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 65.7 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 2818 |
| Q1 | 15011.5 |
| median | 29857 |
| Q3 | 44596 |
| 95-th percentile | 57061 |
| Maximum | 59973 |
| Range | 59970 |
| Interquartile range (IQR) | 29584.5 |
Descriptive statistics
| Standard deviation | 17260.883 |
|---|---|
| Coefficient of variation (CV) | 0.57603137 |
| Kurtosis | -1.1783167 |
| Mean | 29965.18 |
| Median Absolute Deviation (MAD) | 14778 |
| Skewness | 0.0038108922 |
| Sum | 2.5167754 × 108 |
| Variance | 2.979381 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 43745 | 6 | 0.1% |
| 24132 | 6 | 0.1% |
| 43875 | 5 | 0.1% |
| 57253 | 5 | 0.1% |
| 59781 | 5 | 0.1% |
| 12067 | 5 | 0.1% |
| 52896 | 5 | 0.1% |
| 33797 | 5 | 0.1% |
| 43488 | 5 | 0.1% |
| 8995 | 5 | 0.1% |
| Other values (5486) | 8347 |
| Value | Count | Frequency (%) |
| 3 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 32 | 4 | |
| 35 | 2 | |
| 36 | 1 | < 0.1% |
| 65 | 1 | < 0.1% |
| 66 | 1 | < 0.1% |
| 69 | 2 | |
| 70 | 2 | |
| 96 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 59973 | 2 | |
| 59971 | 3 | |
| 59969 | 2 | |
| 59943 | 1 | < 0.1% |
| 59942 | 1 | < 0.1% |
| 59939 | 1 | < 0.1% |
| 59937 | 1 | < 0.1% |
| 59911 | 1 | < 0.1% |
| 59909 | 2 | |
| 59906 | 1 | < 0.1% |
order_priority
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.7 KiB |
| High | |
|---|---|
| Low | |
| Not Specified | |
| Medium | |
| Critical |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 6.7410406 |
| Min length | 3 |
Characters and Unicode
| Total characters | 56618 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | High |
|---|---|
| 2nd row | Not Specified |
| 3rd row | Low |
| 4th row | Critical |
| 5th row | Low |
Common Values
| Value | Count | Frequency (%) |
| High | 1768 | |
| Low | 1720 | |
| Not Specified | 1672 | |
| Medium | 1631 | |
| Critical | 1608 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| high | 1768 | |
| low | 1720 | |
| not | 1672 | |
| specified | 1672 | |
| medium | 1631 | |
| critical | 1608 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 9959 | |
| e | 4975 | 8.8% |
| o | 3392 | 6.0% |
| d | 3303 | 5.8% |
| t | 3280 | 5.8% |
| c | 3280 | 5.8% |
| H | 1768 | 3.1% |
| g | 1768 | 3.1% |
| h | 1768 | 3.1% |
| L | 1720 | 3.0% |
| Other values (13) | 21405 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 44875 | |
| Uppercase Letter | 10071 | 17.8% |
| Space Separator | 1672 | 3.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 9959 | |
| e | 4975 | |
| o | 3392 | 7.6% |
| d | 3303 | 7.4% |
| t | 3280 | 7.3% |
| c | 3280 | 7.3% |
| g | 1768 | 3.9% |
| h | 1768 | 3.9% |
| w | 1720 | 3.8% |
| f | 1672 | 3.7% |
| Other values (6) | 9758 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 1768 | |
| L | 1720 | |
| S | 1672 | |
| N | 1672 | |
| M | 1631 | |
| C | 1608 |
Space Separator
| Value | Count | Frequency (%) |
| 1672 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 54946 | |
| Common | 1672 | 3.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 9959 | |
| e | 4975 | 9.1% |
| o | 3392 | 6.2% |
| d | 3303 | 6.0% |
| t | 3280 | 6.0% |
| c | 3280 | 6.0% |
| H | 1768 | 3.2% |
| g | 1768 | 3.2% |
| h | 1768 | 3.2% |
| L | 1720 | 3.1% |
| Other values (12) | 19733 |
Common
| Value | Count | Frequency (%) |
| 1672 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56618 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 9959 | |
| e | 4975 | 8.8% |
| o | 3392 | 6.0% |
| d | 3303 | 5.8% |
| t | 3280 | 5.8% |
| c | 3280 | 5.8% |
| H | 1768 | 3.1% |
| g | 1768 | 3.1% |
| h | 1768 | 3.1% |
| L | 1720 | 3.0% |
| Other values (13) | 21405 |
order_quantity
Real number (ℝ)
| Distinct | 50 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.571735 |
| Minimum | 1 |
|---|---|
| Maximum | 50 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 65.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 13 |
| median | 26 |
| Q3 | 38 |
| 95-th percentile | 48 |
| Maximum | 50 |
| Range | 49 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 14.481071 |
|---|---|
| Coefficient of variation (CV) | 0.56629209 |
| Kurtosis | -1.2080203 |
| Mean | 25.571735 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.017317782 |
| Sum | 214777 |
| Variance | 209.70142 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 31 | 202 | 2.4% |
| 4 | 196 | 2.3% |
| 39 | 195 | 2.3% |
| 46 | 193 | 2.3% |
| 23 | 192 | 2.3% |
| 24 | 192 | 2.3% |
| 3 | 189 | 2.3% |
| 42 | 189 | 2.3% |
| 43 | 184 | 2.2% |
| 41 | 183 | 2.2% |
| Other values (40) | 6484 |
| Value | Count | Frequency (%) |
| 1 | 165 | |
| 2 | 152 | |
| 3 | 189 | |
| 4 | 196 | |
| 5 | 166 | |
| 6 | 172 | |
| 7 | 174 | |
| 8 | 176 | |
| 9 | 155 | |
| 10 | 170 |
| Value | Count | Frequency (%) |
| 50 | 182 | |
| 49 | 136 | |
| 48 | 172 | |
| 47 | 166 | |
| 46 | 193 | |
| 45 | 163 | |
| 44 | 157 | |
| 43 | 184 | |
| 42 | 189 | |
| 41 | 183 |
product_base_margin
Real number (ℝ)
| Distinct | 51 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 63 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5125132 |
| Minimum | 0.35 |
|---|---|
| Maximum | 0.85 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 65.7 KiB |
Quantile statistics
| Minimum | 0.35 |
|---|---|
| 5-th percentile | 0.36 |
| Q1 | 0.38 |
| median | 0.52 |
| Q3 | 0.59 |
| 95-th percentile | 0.78 |
| Maximum | 0.85 |
| Range | 0.5 |
| Interquartile range (IQR) | 0.21 |
Descriptive statistics
| Standard deviation | 0.13558894 |
|---|---|
| Coefficient of variation (CV) | 0.26455698 |
| Kurtosis | -0.66087023 |
| Mean | 0.5125132 |
| Median Absolute Deviation (MAD) | 0.12 |
| Skewness | 0.55939959 |
| Sum | 4272.31 |
| Variance | 0.018384361 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.37 | 761 | 9.1% |
| 0.38 | 678 | 8.1% |
| 0.36 | 628 | 7.5% |
| 0.59 | 497 | 5.9% |
| 0.39 | 482 | 5.7% |
| 0.56 | 459 | 5.5% |
| 0.57 | 459 | 5.5% |
| 0.4 | 408 | 4.9% |
| 0.58 | 387 | 4.6% |
| 0.55 | 314 | 3.7% |
| Other values (41) | 3263 |
| Value | Count | Frequency (%) |
| 0.35 | 262 | 3.1% |
| 0.36 | 628 | |
| 0.37 | 761 | |
| 0.38 | 678 | |
| 0.39 | 482 | |
| 0.4 | 408 | |
| 0.41 | 98 | 1.2% |
| 0.42 | 78 | 0.9% |
| 0.43 | 101 | 1.2% |
| 0.44 | 94 | 1.1% |
| Value | Count | Frequency (%) |
| 0.85 | 36 | |
| 0.84 | 25 | 0.3% |
| 0.83 | 83 | |
| 0.82 | 32 | 0.4% |
| 0.81 | 73 | |
| 0.8 | 48 | |
| 0.79 | 68 | |
| 0.78 | 89 | |
| 0.77 | 68 | |
| 0.76 | 55 |
product_category
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.7 KiB |
| Office Supplies | |
|---|---|
| Technology | |
| Furniture |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 12.539112 |
| Min length | 9 |
Characters and Unicode
| Total characters | 105316 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Office Supplies |
|---|---|
| 2nd row | Office Supplies |
| 3rd row | Furniture |
| 4th row | Office Supplies |
| 5th row | Furniture |
Common Values
| Value | Count | Frequency (%) |
| Office Supplies | 4610 | |
| Technology | 2065 | |
| Furniture | 1724 | 20.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| office | 4610 | |
| supplies | 4610 | |
| technology | 2065 | |
| furniture | 1724 | 13.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 13009 | |
| i | 10944 | 10.4% |
| p | 9220 | 8.8% |
| f | 9220 | 8.8% |
| u | 8058 | 7.7% |
| c | 6675 | 6.3% |
| l | 6675 | 6.3% |
| O | 4610 | 4.4% |
| s | 4610 | 4.4% |
| S | 4610 | 4.4% |
| Other values (10) | 27685 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 87697 | |
| Uppercase Letter | 13009 | 12.4% |
| Space Separator | 4610 | 4.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 13009 | |
| i | 10944 | |
| p | 9220 | |
| f | 9220 | |
| u | 8058 | |
| c | 6675 | |
| l | 6675 | |
| s | 4610 | 5.3% |
| o | 4130 | 4.7% |
| n | 3789 | 4.3% |
| Other values (5) | 11367 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 4610 | |
| S | 4610 | |
| T | 2065 | |
| F | 1724 | 13.3% |
Space Separator
| Value | Count | Frequency (%) |
| 4610 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 100706 | |
| Common | 4610 | 4.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 13009 | |
| i | 10944 | |
| p | 9220 | 9.2% |
| f | 9220 | 9.2% |
| u | 8058 | 8.0% |
| c | 6675 | 6.6% |
| l | 6675 | 6.6% |
| O | 4610 | 4.6% |
| s | 4610 | 4.6% |
| S | 4610 | 4.6% |
| Other values (9) | 23075 |
Common
| Value | Count | Frequency (%) |
| 4610 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 105316 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 13009 | |
| i | 10944 | 10.4% |
| p | 9220 | 8.8% |
| f | 9220 | 8.8% |
| u | 8058 | 7.7% |
| c | 6675 | 6.3% |
| l | 6675 | 6.3% |
| O | 4610 | 4.4% |
| s | 4610 | 4.4% |
| S | 4610 | 4.4% |
| Other values (10) | 27685 |
product_container
Categorical
HIGH CORRELATION 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.7 KiB |
| Small Box | |
|---|---|
| Wrap Bag | |
| Small Pack | |
| Jumbo Drum | |
| Jumbo Box | |
| Other values (2) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9.0926301 |
| Min length | 8 |
Characters and Unicode
| Total characters | 76369 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Small Box |
|---|---|
| 2nd row | Large Box |
| 3rd row | Jumbo Drum |
| 4th row | Wrap Bag |
| 5th row | Jumbo Drum |
Common Values
| Value | Count | Frequency (%) |
| Small Box | 4347 | |
| Wrap Bag | 1168 | 13.9% |
| Small Pack | 956 | 11.4% |
| Jumbo Drum | 624 | 7.4% |
| Jumbo Box | 532 | 6.3% |
| Large Box | 406 | 4.8% |
| Medium Box | 366 | 4.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| box | 5651 | |
| small | 5303 | |
| wrap | 1168 | 7.0% |
| bag | 1168 | 7.0% |
| jumbo | 1156 | 6.9% |
| pack | 956 | 5.7% |
| drum | 624 | 3.7% |
| large | 406 | 2.4% |
| medium | 366 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 10606 | |
| a | 9001 | |
| 8399 | ||
| m | 7449 | |
| B | 6819 | |
| o | 6807 | |
| x | 5651 | |
| S | 5303 | |
| r | 2198 | 2.9% |
| u | 2146 | 2.8% |
| Other values (14) | 11990 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 51172 | |
| Uppercase Letter | 16798 | 22.0% |
| Space Separator | 8399 | 11.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 10606 | |
| a | 9001 | |
| m | 7449 | |
| o | 6807 | |
| x | 5651 | |
| r | 2198 | 4.3% |
| u | 2146 | 4.2% |
| g | 1574 | 3.1% |
| p | 1168 | 2.3% |
| b | 1156 | 2.3% |
| Other values (5) | 3416 | 6.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 6819 | |
| S | 5303 | |
| W | 1168 | 7.0% |
| J | 1156 | 6.9% |
| P | 956 | 5.7% |
| D | 624 | 3.7% |
| L | 406 | 2.4% |
| M | 366 | 2.2% |
Space Separator
| Value | Count | Frequency (%) |
| 8399 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 67970 | |
| Common | 8399 | 11.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 10606 | |
| a | 9001 | |
| m | 7449 | |
| B | 6819 | |
| o | 6807 | |
| x | 5651 | |
| S | 5303 | |
| r | 2198 | 3.2% |
| u | 2146 | 3.2% |
| g | 1574 | 2.3% |
| Other values (13) | 10416 |
Common
| Value | Count | Frequency (%) |
| 8399 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 76369 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 10606 | |
| a | 9001 | |
| 8399 | ||
| m | 7449 | |
| B | 6819 | |
| o | 6807 | |
| x | 5651 | |
| S | 5303 | |
| r | 2198 | 2.9% |
| u | 2146 | 2.8% |
| Other values (14) | 11990 |
product_name
Text
| Distinct | 1263 |
|---|---|
| Distinct (%) | 15.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.7 KiB |
Length
| Max length | 98 |
|---|---|
| Median length | 75 |
| Mean length | 34.351709 |
| Min length | 3 |
Characters and Unicode
| Total characters | 288520 |
|---|---|
| Distinct characters | 84 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 4 ? |
Unique
| Unique | 57 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | Perma STOR-ALL™ Hanging File Box, 13 1/8"W x 12 1/4"D x 10 1/2"H |
|---|---|
| 2nd row | Safco Industrial Wire Shelving |
| 3rd row | Hon 4070 Series Pagoda™ Armless Upholstered Stacking Chairs |
| 4th row | White GlueTop Scratch Pads |
| 5th row | Hon Valutask™ Swivel Chairs |
| Value | Count | Frequency (%) |
| xerox | 765 | 1.8% |
| x | 499 | 1.2% |
| avery | 418 | 1.0% |
| with | 405 | 0.9% |
| black | 338 | 0.8% |
| 327 | 0.8% | |
| binders | 305 | 0.7% |
| for | 302 | 0.7% |
| chair | 276 | 0.6% |
| keyboard | 268 | 0.6% |
| Other values (2076) | 38861 |
Most occurring characters
| Value | Count | Frequency (%) |
| 34365 | 11.9% | |
| e | 25862 | 9.0% |
| r | 15875 | 5.5% |
| o | 15517 | 5.4% |
| a | 14627 | 5.1% |
| i | 14001 | 4.9% |
| l | 12489 | 4.3% |
| t | 12482 | 4.3% |
| n | 11792 | 4.1% |
| s | 11438 | 4.0% |
| Other values (74) | 120072 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 184442 | |
| Uppercase Letter | 42833 | 14.8% |
| Space Separator | 34365 | 11.9% |
| Decimal Number | 16531 | 5.7% |
| Other Punctuation | 6372 | 2.2% |
| Dash Punctuation | 2329 | 0.8% |
| Other Symbol | 1374 | 0.5% |
| Final Punctuation | 69 | < 0.1% |
| Open Punctuation | 68 | < 0.1% |
| Close Punctuation | 68 | < 0.1% |
| Other values (2) | 69 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 25862 | |
| r | 15875 | 8.6% |
| o | 15517 | 8.4% |
| a | 14627 | 7.9% |
| i | 14001 | 7.6% |
| l | 12489 | 6.8% |
| t | 12482 | 6.8% |
| n | 11792 | 6.4% |
| s | 11438 | 6.2% |
| c | 7441 | 4.0% |
| Other values (17) | 42918 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 4518 | 10.5% |
| C | 4434 | 10.4% |
| P | 4013 | 9.4% |
| B | 3859 | 9.0% |
| D | 2625 | 6.1% |
| A | 2478 | 5.8% |
| M | 2403 | 5.6% |
| F | 2020 | 4.7% |
| T | 1985 | 4.6% |
| R | 1664 | 3.9% |
| Other values (16) | 12834 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3209 | |
| 0 | 2724 | |
| 2 | 1984 | |
| 3 | 1528 | |
| 8 | 1410 | |
| 4 | 1406 | |
| 9 | 1255 | 7.6% |
| 5 | 1142 | 6.9% |
| 6 | 943 | 5.7% |
| 7 | 930 | 5.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2856 | |
| / | 1359 | |
| " | 1060 | 16.6% |
| . | 520 | 8.2% |
| & | 216 | 3.4% |
| ' | 149 | 2.3% |
| # | 104 | 1.6% |
| * | 59 | 0.9% |
| % | 45 | 0.7% |
| ; | 4 | 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ® | 896 | |
| ™ | 478 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 53 | |
| [ | 15 | 22.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 53 | |
| ] | 15 | 22.1% |
Space Separator
| Value | Count | Frequency (%) |
| 34365 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2329 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 69 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 40 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 29 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 227275 | |
| Common | 61245 | 21.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 25862 | 11.4% |
| r | 15875 | 7.0% |
| o | 15517 | 6.8% |
| a | 14627 | 6.4% |
| i | 14001 | 6.2% |
| l | 12489 | 5.5% |
| t | 12482 | 5.5% |
| n | 11792 | 5.2% |
| s | 11438 | 5.0% |
| c | 7441 | 3.3% |
| Other values (43) | 85751 |
Common
| Value | Count | Frequency (%) |
| 34365 | ||
| 1 | 3209 | 5.2% |
| , | 2856 | 4.7% |
| 0 | 2724 | 4.4% |
| - | 2329 | 3.8% |
| 2 | 1984 | 3.2% |
| 3 | 1528 | 2.5% |
| 8 | 1410 | 2.3% |
| 4 | 1406 | 2.3% |
| / | 1359 | 2.2% |
| Other values (21) | 8075 | 13.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 287046 | |
| None | 898 | 0.3% |
| Letterlike Symbols | 478 | 0.2% |
| Punctuation | 98 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 34365 | 12.0% | |
| e | 25862 | 9.0% |
| r | 15875 | 5.5% |
| o | 15517 | 5.4% |
| a | 14627 | 5.1% |
| i | 14001 | 4.9% |
| l | 12489 | 4.4% |
| t | 12482 | 4.3% |
| n | 11792 | 4.1% |
| s | 11438 | 4.0% |
| Other values (69) | 118598 |
None
| Value | Count | Frequency (%) |
| ® | 896 | |
| à | 2 | 0.2% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 478 |
Punctuation
| Value | Count | Frequency (%) |
| ” | 69 | |
| “ | 29 |
product_sub_category
Categorical
HIGH CORRELATION 
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.7 KiB |
| Paper | |
|---|---|
| Binders and Binder Accessories | |
| Telephones and Communication | |
| Office Furnishings | |
| Computer Peripherals | |
| Other values (12) |
Length
| Max length | 30 |
|---|---|
| Median length | 20 |
| Mean length | 17.080962 |
| Min length | 5 |
Characters and Unicode
| Total characters | 143463 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Storage & Organization |
|---|---|
| 2nd row | Storage & Organization |
| 3rd row | Chairs & Chairmats |
| 4th row | Paper |
| 5th row | Chairs & Chairmats |
Common Values
| Value | Count | Frequency (%) |
| Paper | 1225 | |
| Binders and Binder Accessories | 915 | |
| Telephones and Communication | 883 | |
| Office Furnishings | 788 | |
| Computer Peripherals | 758 | |
| Pens & Art Supplies | 633 | |
| Storage & Organization | 546 | 6.5% |
| Appliances | 434 | 5.2% |
| Chairs & Chairmats | 386 | 4.6% |
| Tables | 361 | 4.3% |
| Other values (7) | 1470 |
Length
| Value | Count | Frequency (%) |
| and | 2029 | 10.5% |
| 1565 | 8.1% | |
| paper | 1225 | 6.3% |
| office | 1125 | 5.8% |
| binders | 915 | 4.7% |
| binder | 915 | 4.7% |
| accessories | 915 | 4.7% |
| telephones | 883 | 4.6% |
| communication | 883 | 4.6% |
| furnishings | 788 | 4.1% |
| Other values (22) | 8098 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 15400 | 10.7% |
| s | 11945 | 8.3% |
| i | 11613 | 8.1% |
| n | 11005 | 7.7% |
| 10942 | 7.6% | |
| r | 10371 | 7.2% |
| a | 9566 | 6.7% |
| o | 6269 | 4.4% |
| p | 6091 | 4.2% |
| c | 4942 | 3.4% |
| Other values (27) | 45319 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 115065 | |
| Uppercase Letter | 15747 | 11.0% |
| Space Separator | 10942 | 7.6% |
| Other Punctuation | 1709 | 1.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 15400 | |
| s | 11945 | |
| i | 11613 | |
| n | 11005 | |
| r | 10371 | |
| a | 9566 | |
| o | 6269 | 5.4% |
| p | 6091 | 5.3% |
| c | 4942 | 4.3% |
| d | 4038 | 3.5% |
| Other values (12) | 23825 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2616 | |
| C | 2500 | |
| B | 2198 | |
| A | 1982 | |
| O | 1671 | |
| T | 1388 | |
| S | 1323 | |
| F | 875 | 5.6% |
| M | 337 | 2.1% |
| R | 323 | 2.1% |
| Other values (2) | 534 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 1565 | |
| , | 144 | 8.4% |
Space Separator
| Value | Count | Frequency (%) |
| 10942 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 130812 | |
| Common | 12651 | 8.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 15400 | 11.8% |
| s | 11945 | 9.1% |
| i | 11613 | 8.9% |
| n | 11005 | 8.4% |
| r | 10371 | 7.9% |
| a | 9566 | 7.3% |
| o | 6269 | 4.8% |
| p | 6091 | 4.7% |
| c | 4942 | 3.8% |
| d | 4038 | 3.1% |
| Other values (24) | 39572 |
Common
| Value | Count | Frequency (%) |
| 10942 | ||
| & | 1565 | 12.4% |
| , | 144 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 143463 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 15400 | 10.7% |
| s | 11945 | 8.3% |
| i | 11613 | 8.1% |
| n | 11005 | 7.7% |
| 10942 | 7.6% | |
| r | 10371 | 7.2% |
| a | 9566 | 6.7% |
| o | 6269 | 4.4% |
| p | 6091 | 4.2% |
| c | 4942 | 3.4% |
| Other values (27) | 45319 |
profit
Real number (ℝ)
| Distinct | 7967 |
|---|---|
| Distinct (%) | 94.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 181.18442 |
| Minimum | -14140.702 |
|---|---|
| Maximum | 27220.69 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 4264 |
| Negative (%) | 50.8% |
| Memory size | 65.7 KiB |
Quantile statistics
| Minimum | -14140.702 |
|---|---|
| 5-th percentile | -592.43905 |
| Q1 | -83.315 |
| median | -1.5 |
| Q3 | 162.748 |
| 95-th percentile | 1542.309 |
| Maximum | 27220.69 |
| Range | 41361.392 |
| Interquartile range (IQR) | 246.063 |
Descriptive statistics
| Standard deviation | 1196.6533 |
|---|---|
| Coefficient of variation (CV) | 6.6046149 |
| Kurtosis | 67.34971 |
| Mean | 181.18442 |
| Median Absolute Deviation (MAD) | 104.3345 |
| Skewness | 3.6472388 |
| Sum | 1521768 |
| Variance | 1431979.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -969.048366 | 8 | 0.1% |
| -433.290143 | 6 | 0.1% |
| -505.984479 | 5 | 0.1% |
| -715.778206 | 5 | 0.1% |
| -1331.553366 | 5 | 0.1% |
| -528.653125 | 5 | 0.1% |
| 11.65095 | 5 | 0.1% |
| -66.87 | 4 | < 0.1% |
| -22.82 | 4 | < 0.1% |
| -513.79042 | 4 | < 0.1% |
| Other values (7957) | 8348 |
| Value | Count | Frequency (%) |
| -14140.7016 | 1 | |
| -12557.9976 | 1 | |
| -11984.3979 | 1 | |
| -11861.46 | 1 | |
| -11769.17 | 1 | |
| -11053.6 | 1 | |
| -10263.6597 | 1 | |
| -9611.91 | 1 | |
| -9078.94 | 1 | |
| -8570.4483 | 1 |
| Value | Count | Frequency (%) |
| 27220.69 | 1 | |
| 14440.39 | 1 | |
| 13340.26 | 1 | |
| 12748.86 | 1 | |
| 12606.81 | 1 | |
| 11984.395 | 1 | |
| 11630.146 | 1 | |
| 11562.08 | 1 | |
| 11535.282 | 1 | |
| 10951.3065 | 1 |
region
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.7 KiB |
| Central | |
|---|---|
| West | |
| East | |
| South |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 5.186808 |
| Min length | 4 |
Characters and Unicode
| Total characters | 43564 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | East |
|---|---|
| 2nd row | East |
| 3rd row | West |
| 4th row | West |
| 5th row | West |
Common Values
| Value | Count | Frequency (%) |
| Central | 2710 | |
| West | 1956 | |
| East | 1895 | |
| South | 1838 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| central | 2710 | |
| west | 1956 | |
| east | 1895 | |
| south | 1838 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 8399 | |
| e | 4666 | |
| a | 4605 | |
| s | 3851 | |
| C | 2710 | 6.2% |
| n | 2710 | 6.2% |
| r | 2710 | 6.2% |
| l | 2710 | 6.2% |
| W | 1956 | 4.5% |
| E | 1895 | 4.3% |
| Other values (4) | 7352 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 35165 | |
| Uppercase Letter | 8399 | 19.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 8399 | |
| e | 4666 | |
| a | 4605 | |
| s | 3851 | |
| n | 2710 | 7.7% |
| r | 2710 | 7.7% |
| l | 2710 | 7.7% |
| o | 1838 | 5.2% |
| u | 1838 | 5.2% |
| h | 1838 | 5.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2710 | |
| W | 1956 | |
| E | 1895 | |
| S | 1838 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 43564 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 8399 | |
| e | 4666 | |
| a | 4605 | |
| s | 3851 | |
| C | 2710 | 6.2% |
| n | 2710 | 6.2% |
| r | 2710 | 6.2% |
| l | 2710 | 6.2% |
| W | 1956 | 4.5% |
| E | 1895 | 4.3% |
| Other values (4) | 7352 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 43564 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 8399 | |
| e | 4666 | |
| a | 4605 | |
| s | 3851 | |
| C | 2710 | 6.2% |
| n | 2710 | 6.2% |
| r | 2710 | 6.2% |
| l | 2710 | 6.2% |
| W | 1956 | 4.5% |
| E | 1895 | 4.3% |
| Other values (4) | 7352 |
row_id
Real number (ℝ)
HIGH CORRELATION  UNIFORM  UNIQUE 
| Distinct | 8399 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4200 |
| Minimum | 1 |
|---|---|
| Maximum | 8399 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 65.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 420.9 |
| Q1 | 2100.5 |
| median | 4200 |
| Q3 | 6299.5 |
| 95-th percentile | 7979.1 |
| Maximum | 8399 |
| Range | 8398 |
| Interquartile range (IQR) | 4199 |
Descriptive statistics
| Standard deviation | 2424.7268 |
|---|---|
| Coefficient of variation (CV) | 0.5773159 |
| Kurtosis | -1.2 |
| Mean | 4200 |
| Median Absolute Deviation (MAD) | 2100 |
| Skewness | 0 |
| Sum | 35275800 |
| Variance | 5879300 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4031 | 1 | < 0.1% |
| 671 | 1 | < 0.1% |
| 669 | 1 | < 0.1% |
| 672 | 1 | < 0.1% |
| 4860 | 1 | < 0.1% |
| 883 | 1 | < 0.1% |
| 5148 | 1 | < 0.1% |
| 1554 | 1 | < 0.1% |
| 5426 | 1 | < 0.1% |
| 6973 | 1 | < 0.1% |
| Other values (8389) | 8389 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 8399 | 1 | |
| 8398 | 1 | |
| 8397 | 1 | |
| 8396 | 1 | |
| 8395 | 1 | |
| 8394 | 1 | |
| 8393 | 1 | |
| 8392 | 1 | |
| 8391 | 1 | |
| 8390 | 1 |
sales
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 8153 |
|---|---|
| Distinct (%) | 97.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1775.8782 |
| Minimum | 2.24 |
|---|---|
| Maximum | 89061.05 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 65.7 KiB |
Quantile statistics
| Minimum | 2.24 |
|---|---|
| 5-th percentile | 34.178 |
| Q1 | 143.195 |
| median | 449.42 |
| Q3 | 1709.32 |
| 95-th percentile | 7844.335 |
| Maximum | 89061.05 |
| Range | 89058.81 |
| Interquartile range (IQR) | 1566.125 |
Descriptive statistics
| Standard deviation | 3585.0505 |
|---|---|
| Coefficient of variation (CV) | 2.018748 |
| Kurtosis | 60.928376 |
| Mean | 1775.8782 |
| Median Absolute Deviation (MAD) | 381.95 |
| Skewness | 5.3869824 |
| Sum | 14915601 |
| Variance | 12852587 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 75.19 | 3 | < 0.1% |
| 19.36 | 3 | < 0.1% |
| 20.19 | 3 | < 0.1% |
| 74.02 | 3 | < 0.1% |
| 10.48 | 3 | < 0.1% |
| 224.58 | 3 | < 0.1% |
| 43.29 | 3 | < 0.1% |
| 151.19 | 3 | < 0.1% |
| 127.56 | 3 | < 0.1% |
| 115.81 | 3 | < 0.1% |
| Other values (8143) | 8369 |
| Value | Count | Frequency (%) |
| 2.24 | 1 | |
| 3.2 | 1 | |
| 3.23 | 1 | |
| 3.41 | 1 | |
| 3.42 | 1 | |
| 3.63 | 1 | |
| 3.77 | 1 | |
| 3.85 | 1 | |
| 3.96 | 1 | |
| 4.94 | 1 |
| Value | Count | Frequency (%) |
| 89061.05 | 1 | |
| 45923.76 | 1 | |
| 41343.21 | 1 | |
| 33367.85 | 1 | |
| 29884.6 | 1 | |
| 29345.27 | 1 | |
| 29186.49 | 1 | |
| 28761.52 | 1 | |
| 28664.52 | 1 | |
| 28389.14 | 1 |
ship_date
Date
| Distinct | 1450 |
|---|---|
| Distinct (%) | 17.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.7 KiB |
| Minimum | 2012-01-02 00:00:00 |
|---|---|
| Maximum | 2015-12-30 00:00:00 |
ship_mode
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.7 KiB |
| Regular Air | |
|---|---|
| Delivery Truck | |
| Express Air |
Length
| Max length | 14 |
|---|---|
| Median length | 11 |
| Mean length | 11.409334 |
| Min length | 11 |
Characters and Unicode
| Total characters | 95827 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Regular Air |
|---|---|
| 2nd row | Express Air |
| 3rd row | Delivery Truck |
| 4th row | Regular Air |
| 5th row | Delivery Truck |
Common Values
| Value | Count | Frequency (%) |
| Regular Air | 6270 | |
| Delivery Truck | 1146 | 13.6% |
| Express Air | 983 | 11.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| air | 7253 | |
| regular | 6270 | |
| delivery | 1146 | 6.8% |
| truck | 1146 | 6.8% |
| express | 983 | 5.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 16798 | |
| e | 9545 | |
| 8399 | ||
| i | 8399 | |
| u | 7416 | |
| l | 7416 | |
| A | 7253 | |
| R | 6270 | 6.5% |
| g | 6270 | 6.5% |
| a | 6270 | 6.5% |
| Other values (10) | 11791 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 70630 | |
| Uppercase Letter | 16798 | 17.5% |
| Space Separator | 8399 | 8.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 16798 | |
| e | 9545 | |
| i | 8399 | |
| u | 7416 | |
| l | 7416 | |
| g | 6270 | 8.9% |
| a | 6270 | 8.9% |
| s | 1966 | 2.8% |
| v | 1146 | 1.6% |
| y | 1146 | 1.6% |
| Other values (4) | 4258 | 6.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 7253 | |
| R | 6270 | |
| T | 1146 | 6.8% |
| D | 1146 | 6.8% |
| E | 983 | 5.9% |
Space Separator
| Value | Count | Frequency (%) |
| 8399 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 87428 | |
| Common | 8399 | 8.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 16798 | |
| e | 9545 | |
| i | 8399 | |
| u | 7416 | |
| l | 7416 | |
| A | 7253 | |
| R | 6270 | 7.2% |
| g | 6270 | 7.2% |
| a | 6270 | 7.2% |
| s | 1966 | 2.2% |
| Other values (9) | 9825 |
Common
| Value | Count | Frequency (%) |
| 8399 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 95827 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 16798 | |
| e | 9545 | |
| 8399 | ||
| i | 8399 | |
| u | 7416 | |
| l | 7416 | |
| A | 7253 | |
| R | 6270 | 6.5% |
| g | 6270 | 6.5% |
| a | 6270 | 6.5% |
| Other values (10) | 11791 |
shipping_cost
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 652 |
|---|---|
| Distinct (%) | 7.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.838557 |
| Minimum | 0.49 |
|---|---|
| Maximum | 164.73 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 65.7 KiB |
Quantile statistics
| Minimum | 0.49 |
|---|---|
| 5-th percentile | 0.8 |
| Q1 | 3.3 |
| median | 6.07 |
| Q3 | 13.99 |
| 95-th percentile | 55.351 |
| Maximum | 164.73 |
| Range | 164.24 |
| Interquartile range (IQR) | 10.69 |
Descriptive statistics
| Standard deviation | 17.264052 |
|---|---|
| Coefficient of variation (CV) | 1.3447035 |
| Kurtosis | 7.7515872 |
| Mean | 12.838557 |
| Median Absolute Deviation (MAD) | 3.61 |
| Skewness | 2.5538008 |
| Sum | 107831.04 |
| Variance | 298.04749 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19.99 | 352 | 4.2% |
| 8.99 | 321 | 3.8% |
| 1.99 | 247 | 2.9% |
| 0.5 | 190 | 2.3% |
| 0.99 | 144 | 1.7% |
| 4 | 143 | 1.7% |
| 1.49 | 138 | 1.6% |
| 0.7 | 138 | 1.6% |
| 24.49 | 132 | 1.6% |
| 2.99 | 124 | 1.5% |
| Other values (642) | 6470 |
| Value | Count | Frequency (%) |
| 0.49 | 34 | 0.4% |
| 0.5 | 190 | |
| 0.7 | 138 | |
| 0.71 | 22 | 0.3% |
| 0.73 | 1 | < 0.1% |
| 0.75 | 7 | 0.1% |
| 0.76 | 7 | 0.1% |
| 0.78 | 7 | 0.1% |
| 0.79 | 3 | < 0.1% |
| 0.8 | 24 | 0.3% |
| Value | Count | Frequency (%) |
| 164.73 | 1 | < 0.1% |
| 154.12 | 1 | < 0.1% |
| 147.12 | 2 | < 0.1% |
| 143.71 | 1 | < 0.1% |
| 130 | 1 | < 0.1% |
| 126 | 1 | < 0.1% |
| 110.2 | 10 | |
| 99 | 7 | |
| 91.05 | 5 | 0.1% |
| 89.3 | 13 |
state
Categorical
HIGH CORRELATION 
| Distinct | 48 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.7 KiB |
| California | |
|---|---|
| Texas | |
| Illinois | 500 |
| Florida | 479 |
| Ohio | 396 |
| Other values (43) |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 7.7852125 |
| Min length | 2 |
Characters and Unicode
| Total characters | 65388 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Pennsylvania |
|---|---|
| 2nd row | Maryland |
| 3rd row | California |
| 4th row | California |
| 5th row | California |
Common Values
| Value | Count | Frequency (%) |
| California | 780 | 9.3% |
| Texas | 577 | 6.9% |
| Illinois | 500 | 6.0% |
| Florida | 479 | 5.7% |
| Ohio | 396 | 4.7% |
| New York | 372 | 4.4% |
| Michigan | 291 | 3.5% |
| Indiana | 241 | 2.9% |
| Washington | 240 | 2.9% |
| Minnesota | 239 | 2.8% |
| Other values (38) | 4284 |
Length
| Value | Count | Frequency (%) |
| california | 780 | 8.2% |
| new | 687 | 7.2% |
| texas | 577 | 6.1% |
| illinois | 500 | 5.2% |
| florida | 479 | 5.0% |
| ohio | 396 | 4.2% |
| york | 372 | 3.9% |
| carolina | 316 | 3.3% |
| michigan | 291 | 3.1% |
| north | 245 | 2.6% |
| Other values (40) | 4884 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8410 | |
| i | 7507 | 11.5% |
| n | 6295 | 9.6% |
| o | 5725 | 8.8% |
| e | 3794 | 5.8% |
| r | 3778 | 5.8% |
| s | 3729 | 5.7% |
| l | 3413 | 5.2% |
| h | 1743 | 2.7% |
| t | 1465 | 2.2% |
| Other values (36) | 19529 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 54422 | |
| Uppercase Letter | 9838 | 15.0% |
| Space Separator | 1128 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8410 | |
| i | 7507 | |
| n | 6295 | |
| o | 5725 | |
| e | 3794 | |
| r | 3778 | |
| s | 3729 | |
| l | 3413 | 6.3% |
| h | 1743 | 3.2% |
| t | 1465 | 2.7% |
| Other values (14) | 8563 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1358 | |
| C | 1355 | |
| N | 1052 | |
| I | 1031 | |
| O | 829 | |
| T | 743 | |
| A | 532 | 5.4% |
| F | 479 | 4.9% |
| W | 473 | 4.8% |
| Y | 372 | 3.8% |
| Other values (11) | 1614 |
Space Separator
| Value | Count | Frequency (%) |
| 1128 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 64260 | |
| Common | 1128 | 1.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8410 | |
| i | 7507 | |
| n | 6295 | 9.8% |
| o | 5725 | 8.9% |
| e | 3794 | 5.9% |
| r | 3778 | 5.9% |
| s | 3729 | 5.8% |
| l | 3413 | 5.3% |
| h | 1743 | 2.7% |
| t | 1465 | 2.3% |
| Other values (35) | 18401 |
Common
| Value | Count | Frequency (%) |
| 1128 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 65388 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8410 | |
| i | 7507 | 11.5% |
| n | 6295 | 9.6% |
| o | 5725 | 8.8% |
| e | 3794 | 5.8% |
| r | 3778 | 5.8% |
| s | 3729 | 5.7% |
| l | 3413 | 5.2% |
| h | 1743 | 2.7% |
| t | 1465 | 2.2% |
| Other values (36) | 19529 |
unit_price
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 751 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 89.346259 |
| Minimum | 0.99 |
|---|---|
| Maximum | 6783.02 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 65.7 KiB |
Quantile statistics
| Minimum | 0.99 |
|---|---|
| 5-th percentile | 2.88 |
| Q1 | 6.48 |
| median | 20.99 |
| Q3 | 85.99 |
| 95-th percentile | 320.64 |
| Maximum | 6783.02 |
| Range | 6782.03 |
| Interquartile range (IQR) | 79.51 |
Descriptive statistics
| Standard deviation | 290.35438 |
|---|---|
| Coefficient of variation (CV) | 3.2497654 |
| Kurtosis | 271.16873 |
| Mean | 89.346259 |
| Median Absolute Deviation (MAD) | 17.01 |
| Skewness | 14.127793 |
| Sum | 750419.23 |
| Variance | 84305.668 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.48 | 264 | 3.1% |
| 65.99 | 192 | 2.3% |
| 4.98 | 136 | 1.6% |
| 125.99 | 115 | 1.4% |
| 5.98 | 102 | 1.2% |
| 2.88 | 81 | 1.0% |
| 20.99 | 73 | 0.9% |
| 30.98 | 73 | 0.9% |
| 35.99 | 70 | 0.8% |
| 205.99 | 66 | 0.8% |
| Other values (741) | 7227 |
| Value | Count | Frequency (%) |
| 0.99 | 2 | < 0.1% |
| 1.14 | 10 | 0.1% |
| 1.26 | 13 | |
| 1.48 | 12 | |
| 1.6 | 5 | 0.1% |
| 1.68 | 22 | |
| 1.7 | 8 | 0.1% |
| 1.74 | 9 | 0.1% |
| 1.76 | 29 | |
| 1.8 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 6783.02 | 7 | |
| 3502.14 | 6 | |
| 3499.99 | 7 | |
| 2550.14 | 7 | |
| 2036.48 | 6 | |
| 1938.02 | 8 | |
| 1889.99 | 3 | < 0.1% |
| 1637.53 | 2 | < 0.1% |
| 1500.97 | 5 | |
| 1360.14 | 3 | < 0.1% |
zip_code
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 1626 |
|---|---|
| Distinct (%) | 19.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52839.139 |
| Minimum | 1001 |
|---|---|
| Maximum | 99362 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 65.7 KiB |
Quantile statistics
| Minimum | 1001 |
|---|---|
| 5-th percentile | 6032.7 |
| Q1 | 30337 |
| median | 52732 |
| Q3 | 77577 |
| 95-th percentile | 95992.2 |
| Maximum | 99362 |
| Range | 98361 |
| Interquartile range (IQR) | 47240 |
Descriptive statistics
| Standard deviation | 28509.536 |
|---|---|
| Coefficient of variation (CV) | 0.53955337 |
| Kurtosis | -1.1269582 |
| Mean | 52839.139 |
| Median Absolute Deviation (MAD) | 23321 |
| Skewness | -0.055889262 |
| Sum | 4.4379593 × 108 |
| Variance | 8.1279362 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 94110 | 22 | 0.3% |
| 92277 | 22 | 0.3% |
| 88201 | 21 | 0.3% |
| 81301 | 20 | 0.2% |
| 55372 | 19 | 0.2% |
| 59715 | 19 | 0.2% |
| 87105 | 18 | 0.2% |
| 46203 | 17 | 0.2% |
| 4401 | 17 | 0.2% |
| 21222 | 15 | 0.2% |
| Other values (1616) | 8209 |
| Value | Count | Frequency (%) |
| 1001 | 1 | |
| 1007 | 1 | |
| 1013 | 1 | |
| 1027 | 1 | |
| 1028 | 1 | |
| 1040 | 1 | |
| 1056 | 1 | |
| 1060 | 1 | |
| 1069 | 1 | |
| 1075 | 2 |
| Value | Count | Frequency (%) |
| 99362 | 8 | |
| 99352 | 5 | |
| 99336 | 6 | |
| 99301 | 6 | |
| 99207 | 7 | |
| 99163 | 7 | |
| 98902 | 3 | < 0.1% |
| 98801 | 7 | |
| 98661 | 8 | |
| 98632 | 6 |
| customer_age | discount | order_id | order_quantity | product_base_margin | profit | row_id | sales | shipping_cost | unit_price | zip_code | customer_segment | order_priority | product_category | product_container | product_sub_category | region | ship_mode | state | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| customer_age | 1.000 | 0.017 | 0.016 | 0.019 | -0.018 | 0.012 | 0.016 | 0.009 | -0.002 | 0.000 | 0.003 | 0.010 | 0.037 | 0.000 | 0.000 | 0.000 | 0.013 | 0.018 | 0.062 |
| discount | 0.017 | 1.000 | -0.002 | -0.009 | -0.001 | -0.073 | -0.002 | -0.022 | -0.003 | -0.001 | -0.008 | 0.005 | 0.015 | 0.027 | 0.023 | 0.000 | 0.000 | 0.000 | 0.000 |
| order_id | 0.016 | -0.002 | 1.000 | 0.011 | -0.020 | 0.009 | 1.000 | -0.011 | -0.004 | -0.018 | -0.005 | 0.027 | 0.041 | 0.000 | 0.001 | 0.000 | 0.041 | 0.000 | 0.065 |
| order_quantity | 0.019 | -0.009 | 0.011 | 1.000 | 0.011 | 0.238 | 0.011 | 0.399 | -0.024 | -0.030 | 0.005 | 0.000 | 0.014 | 0.000 | 0.000 | 0.017 | 0.012 | 0.000 | 0.018 |
| product_base_margin | -0.018 | -0.001 | -0.020 | 0.011 | 1.000 | -0.207 | -0.020 | 0.349 | 0.291 | 0.392 | -0.006 | 0.016 | 0.014 | 0.442 | 0.291 | 0.422 | 0.016 | 0.336 | 0.017 |
| profit | 0.012 | -0.073 | 0.009 | 0.238 | -0.207 | 1.000 | 0.009 | 0.325 | -0.193 | 0.227 | 0.013 | 0.000 | 0.021 | 0.103 | 0.131 | 0.168 | 0.021 | 0.149 | 0.017 |
| row_id | 0.016 | -0.002 | 1.000 | 0.011 | -0.020 | 0.009 | 1.000 | -0.011 | -0.004 | -0.018 | -0.005 | 0.030 | 0.039 | 0.000 | 0.000 | 0.000 | 0.041 | 0.000 | 0.063 |
| sales | 0.009 | -0.022 | -0.011 | 0.399 | 0.349 | 0.325 | -0.011 | 1.000 | 0.587 | 0.877 | -0.000 | 0.000 | 0.000 | 0.112 | 0.149 | 0.197 | 0.000 | 0.214 | 0.000 |
| shipping_cost | -0.002 | -0.003 | -0.004 | -0.024 | 0.291 | -0.193 | -0.004 | 0.587 | 1.000 | 0.652 | -0.008 | 0.000 | 0.020 | 0.363 | 0.374 | 0.306 | 0.019 | 0.518 | 0.000 |
| unit_price | 0.000 | -0.001 | -0.018 | -0.030 | 0.392 | 0.227 | -0.018 | 0.877 | 0.652 | 1.000 | -0.007 | 0.011 | 0.012 | 0.094 | 0.121 | 0.197 | 0.021 | 0.088 | 0.000 |
| zip_code | 0.003 | -0.008 | -0.005 | 0.005 | -0.006 | 0.013 | -0.005 | -0.000 | -0.008 | -0.007 | 1.000 | 0.117 | 0.036 | 0.000 | 0.014 | 0.000 | 0.885 | 0.000 | 0.970 |
| customer_segment | 0.010 | 0.005 | 0.027 | 0.000 | 0.016 | 0.000 | 0.030 | 0.000 | 0.000 | 0.011 | 0.117 | 1.000 | 0.000 | 0.006 | 0.000 | 0.000 | 0.067 | 0.000 | 0.201 |
| order_priority | 0.037 | 0.015 | 0.041 | 0.014 | 0.014 | 0.021 | 0.039 | 0.000 | 0.020 | 0.012 | 0.036 | 0.000 | 1.000 | 0.000 | 0.005 | 0.000 | 0.003 | 0.003 | 0.064 |
| product_category | 0.000 | 0.027 | 0.000 | 0.000 | 0.442 | 0.103 | 0.000 | 0.112 | 0.363 | 0.094 | 0.000 | 0.006 | 0.000 | 1.000 | 0.491 | 0.999 | 0.019 | 0.382 | 0.019 |
| product_container | 0.000 | 0.023 | 0.001 | 0.000 | 0.291 | 0.131 | 0.000 | 0.149 | 0.374 | 0.121 | 0.014 | 0.000 | 0.005 | 0.491 | 1.000 | 0.654 | 0.032 | 0.703 | 0.026 |
| product_sub_category | 0.000 | 0.000 | 0.000 | 0.017 | 0.422 | 0.168 | 0.000 | 0.197 | 0.306 | 0.197 | 0.000 | 0.000 | 0.000 | 0.999 | 0.654 | 1.000 | 0.041 | 0.611 | 0.005 |
| region | 0.013 | 0.000 | 0.041 | 0.012 | 0.016 | 0.021 | 0.041 | 0.000 | 0.019 | 0.021 | 0.885 | 0.067 | 0.003 | 0.019 | 0.032 | 0.041 | 1.000 | 0.011 | 0.997 |
| ship_mode | 0.018 | 0.000 | 0.000 | 0.000 | 0.336 | 0.149 | 0.000 | 0.214 | 0.518 | 0.088 | 0.000 | 0.000 | 0.003 | 0.382 | 0.703 | 0.611 | 0.011 | 1.000 | 0.033 |
| state | 0.062 | 0.000 | 0.065 | 0.018 | 0.017 | 0.017 | 0.063 | 0.000 | 0.000 | 0.000 | 0.970 | 0.201 | 0.064 | 0.019 | 0.026 | 0.005 | 0.997 | 0.033 | 1.000 |
| city | customer_age | customer_name | customer_segment | discount | number_of_records | order_date | order_id | order_priority | order_quantity | product_base_margin | product_category | product_container | product_name | product_sub_category | profit | region | row_id | sales | ship_date | ship_mode | shipping_cost | state | unit_price | zip_code | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | McKeesport | NaN | Jessica Myrick | Small Business | 0.10 | 1 | 2012-01-01 | 28774 | High | 32 | 0.68 | Office Supplies | Small Box | Perma STOR-ALL™ Hanging File Box, 13 1/8"W x 12 1/4"D x 10 1/2"H | Storage & Organization | -111.800 | East | 4031 | 180.36 | 2012-01-02 | Regular Air | 4.69 | Pennsylvania | 5.98 | 15131 |
| 1 | Bowie | NaN | Matt Collister | Home Office | 0.08 | 1 | 2012-01-01 | 13729 | Not Specified | 9 | NaN | Office Supplies | Large Box | Safco Industrial Wire Shelving | Storage & Organization | -342.910 | East | 1914 | 872.48 | 2012-01-03 | Express Air | 35.00 | Maryland | 95.99 | 20715 |
| 2 | Napa | NaN | Alan Schoenberger | Corporate | 0.00 | 1 | 2012-01-02 | 37537 | Low | 4 | 0.56 | Furniture | Jumbo Drum | Hon 4070 Series Pagoda™ Armless Upholstered Stacking Chairs | Chairs & Chairmats | -193.080 | West | 5272 | 1239.06 | 2012-01-02 | Delivery Truck | 48.80 | California | 291.73 | 94559 |
| 3 | Montebello | NaN | Elizabeth Moffitt | Consumer | 0.08 | 1 | 2012-01-02 | 44069 | Critical | 43 | 0.39 | Office Supplies | Wrap Bag | White GlueTop Scratch Pads | Paper | 247.790 | West | 6225 | 614.80 | 2012-01-02 | Regular Air | 1.97 | California | 15.04 | 90640 |
| 4 | Napa | NaN | Alan Schoenberger | Corporate | 0.07 | 1 | 2012-01-02 | 37537 | Low | 43 | 0.69 | Furniture | Jumbo Drum | Hon Valutask™ Swivel Chairs | Chairs & Chairmats | -1049.850 | West | 5273 | 4083.19 | 2012-01-04 | Delivery Truck | 45.00 | California | 100.98 | 94559 |
| 5 | Montebello | NaN | Elizabeth Moffitt | Consumer | 0.09 | 1 | 2012-01-02 | 44069 | Critical | 16 | 0.40 | Office Supplies | Wrap Bag | Black Print Carbonless Snap-Off® Rapid Letter, 8 1/2" x 7" | Paper | 26.710 | West | 6224 | 137.63 | 2012-01-04 | Express Air | 2.15 | California | 9.11 | 90640 |
| 6 | Prior Lake | NaN | David Philippe | Consumer | 0.06 | 1 | 2012-01-02 | 9285 | Critical | 3 | 0.36 | Office Supplies | Small Box | Avery Trapezoid Ring Binder, 3" Capacity, Black, 1040 sheets | Binders and Binder Accessories | -11.937 | Central | 1279 | 124.81 | 2012-01-04 | Regular Air | 2.99 | Minnesota | 40.98 | 55372 |
| 7 | Napa | NaN | Alan Schoenberger | Corporate | 0.05 | 1 | 2012-01-02 | 37537 | Low | 32 | 0.59 | Office Supplies | Small Box | Dual Level, Single-Width Filing Carts | Storage & Organization | 1438.490 | West | 5274 | 4902.38 | 2012-01-09 | Regular Air | 7.07 | California | 155.06 | 94559 |
| 8 | Phenix City | NaN | Patrick Jones | Home Office | 0.09 | 1 | 2012-01-03 | 40354 | High | 4 | 0.64 | Furniture | Jumbo Box | Bush Advantage Collection® Round Conference Table | Tables | -93.160 | South | 5705 | 698.00 | 2012-01-04 | Delivery Truck | 52.20 | Alabama | 212.60 | 36869 |
| 9 | Draper | NaN | Larry Tron | Home Office | 0.05 | 1 | 2012-01-03 | 9762 | High | 12 | 0.78 | Furniture | Medium Box | 36X48 HARDFLOOR CHAIRMAT | Office Furnishings | -146.050 | West | 1336 | 262.76 | 2012-01-04 | Regular Air | 21.20 | Utah | 20.98 | 84020 |
| city | customer_age | customer_name | customer_segment | discount | number_of_records | order_date | order_id | order_priority | order_quantity | product_base_margin | product_category | product_container | product_name | product_sub_category | profit | region | row_id | sales | ship_date | ship_mode | shipping_cost | state | unit_price | zip_code | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8389 | Scarsdale | 95.0 | Shirley Schmidt | Corporate | 0.05 | 1 | 2015-12-29 | 53730 | High | 40 | 0.36 | Office Supplies | Small Box | Avery® Durable Plastic 1" Binders | Binders and Binder Accessories | -144.739 | East | 7524 | 181.80 | 2015-12-30 | Regular Air | 5.83 | New York | 4.54 | 10583 |
| 8390 | Olathe | 88.0 | Anna Andreadi | Small Business | 0.09 | 1 | 2015-12-29 | 13507 | Medium | 27 | 0.39 | Office Supplies | Small Box | Strathmore Photo Mount Cards | Paper | -75.710 | Central | 1876 | 176.10 | 2015-12-30 | Regular Air | 6.18 | Kansas | 6.78 | 66062 |
| 8391 | Horn Lake | 88.0 | Jennifer Jackson | Home Office | 0.10 | 1 | 2015-12-29 | 29216 | Critical | 46 | 0.64 | Technology | Small Box | Fellowes Mobile Numeric Keypad, Graphite | Computer Peripherals | 307.170 | South | 4100 | 1936.45 | 2015-12-30 | Regular Air | 4.00 | Mississippi | 43.22 | 38637 |
| 8392 | Fairfield | 95.0 | Tony Molinari | Corporate | 0.06 | 1 | 2015-12-30 | 50950 | Not Specified | 6 | 0.70 | Furniture | Jumbo Drum | Novimex Fabric Task Chair | Chairs & Chairmats | -166.960 | West | 7141 | 391.12 | 2015-12-30 | Delivery Truck | 30.00 | California | 60.98 | 94533 |
| 8393 | Charlottesville | 95.0 | Jim Epp | Small Business | 0.08 | 1 | 2015-12-30 | 47815 | Not Specified | 45 | 0.54 | Furniture | Wrap Bag | DAX Wood Document Frame. | Office Furnishings | -33.470 | South | 6712 | 580.96 | 2015-12-30 | Regular Air | 6.85 | Virginia | 13.73 | 22901 |
| 8394 | Fairfield | 95.0 | Tony Molinari | Corporate | 0.10 | 1 | 2015-12-30 | 50950 | Not Specified | 35 | 0.59 | Office Supplies | Small Box | Tenex Personal Project File with Scoop Front Design, Black | Storage & Organization | -15.070 | West | 7142 | 448.10 | 2015-12-30 | Express Air | 4.51 | California | 13.48 | 94533 |
| 8395 | Harker Heights | 95.0 | Matt Hagelstein | Home Office | 0.09 | 1 | 2015-12-30 | 25542 | Low | 37 | 0.39 | Office Supplies | Wrap Bag | Black Print Carbonless 8 1/2" x 8 1/4" Rapid Memo Book | Paper | -18.660 | Central | 3583 | 257.46 | 2015-12-30 | Express Air | 4.23 | Texas | 7.28 | 76543 |
| 8396 | Riverview | 95.0 | Theresa Swint | Consumer | 0.10 | 1 | 2015-12-30 | 45127 | Medium | 10 | 0.37 | Office Supplies | Wrap Bag | Binder Clips by OIC | Rubber Bands | -1.290 | South | 6361 | 14.15 | 2015-12-30 | Regular Air | 0.70 | Florida | 1.48 | 33569 |
| 8397 | Nicholasville | 95.0 | Maribeth Yedwab | Home Office | 0.09 | 1 | 2015-12-30 | 49344 | Low | 1 | 0.83 | Office Supplies | Medium Box | Martin Yale Chadless Opener Electric Letter Opener | Scissors, Rulers and Trimmers | -745.200 | South | 6916 | 803.33 | 2015-12-30 | Regular Air | 24.49 | Kentucky | 832.81 | 40356 |
| 8398 | Nicholasville | 95.0 | Maribeth Yedwab | Home Office | 0.00 | 1 | 2015-12-30 | 49344 | Low | 31 | 0.68 | Technology | Small Box | Belkin 105-Key Black Keyboard | Computer Peripherals | 27.850 | South | 6915 | 672.93 | 2015-12-30 | Regular Air | 4.00 | Kentucky | 19.98 | 40356 |